Design of Document Profile Database for Browsing in Information Retrieval Systems

نویسنده

  • Jae-Woo LEE
چکیده

Information retrieval is one of the most important technologies at present. We can always get information in the Internet or distributed computing systems using various information retrieval models. For searching proper information that we need, it is necessary to construct efficient information retrieval agent systems helping many web clients’ requests. In this paper, we propose a simple new model for information retrieval agents based on many terms or keywords distribution in a document or distributed database. For the key paragraph extraction we use meaningful term’s frequency and the key word distribution characteristics in a document, and those terms are selected by using stemming, filtering stop-lists, synonym for search meaningful terms in a document. The agent receives a web client’s information retrieval request and extracts key paragraph with frequency and distribution using the keywords of the client, and then the agent constructs profile of the documents with the keywords, key paragraph and location address in document for the document browsing. And then we can search many documents or knowledge easily using the profile for information retrieval and browse the document.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Skips for Faster Postings List Intersection

Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...

متن کامل

Improved Skips for Faster Postings List Intersection

Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...

متن کامل

Investigating the Impact of Authors’ Rank in Bibliographic Networks on Expertise Retrieval

Background and Aim: this research investigates the impact of authors’ rank in Bibliographic networks on document-centered model of Expertise Retrieval. Its purpose is to find out what kind of authors’ ranking in bibliographic networks can improve the performance of document-centered model.   Methodology: Current research is an experimental one. To operationalize research goals, a new test colle...

متن کامل

Incremental Development of Browsing for Domain-Specific Document Retrieval Systems

Browsing is being supported in many information retrieval systems to supplement Boolean querying. We have implemented a web-based browsing mechanism for a domain-specific document retrieval system based on the concept lattice of Formal Concept Analysis. In this paper, we have proposed and implemented an incremental development of browsing by combing Formal Concept Analysis (FCA) and Ripple Down...

متن کامل

A Gray Code Based Ordering for Documents on Shelves: Classification for Browsing and Retrieval Journal of the American Society for Information

A document classifier places documents together in a linear arrangement for browsing or high speed access by human or computerized information retrieval systems. Requirements for document classification and browsing systems are developed from similarity measures, distance measures, and the notion of subject aboutness. A requirement that documents be arranged in decreasing order of similarity as...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007